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(57) Abstract 

The present invention relates to a method for in vitro evolution 
of protein function. In panicular, the method relates to the shuffling 
of nucleotide segments obtained from exonuclease digestion. The 
present inventors have shown that polynucleotide fragments derived 
from a parent polynucleotide sequence digested with an exonuclease 
can be combined to generate a polynucleotide sequence which 
encodes for a polypeptide having desired characteristics. This 
method may be usefully applied to the generation of new antibodies 
or parts thereof having modified characteristics as compared to the 
parent antibody. 
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A METHOD FOR IN VITRO MOLECUIAR EVOLUTION 
OF PROTEIN FUNCTION 

Field of the invention 

The present invention relates to a method for m vxtrc 
molecular evolution of protein function, m particulai" by 
shuffling of DNA segments obtained using an exonuclease. 

Backgroiind of the invention 

Protein function can be modified and improved in vitro 
by a variety of methods, including site directed 
mutagenesis (Alber et al , Nature, 5; 33 0 ( 6 143 ) : 4 1 - 4 6 , 1387) 
combinatorial cloning (Huse et al , Science, 246:1275-12 81, 
1989 ; Marks et al , Biotechnology, 10: 779-783, 1992) and 
random mutagenesis combined with appropriate selection 
systems (Barbas et al , PNAS . USA, 89: 4457-4461, 1992). 

The method of random mutagenesis together with 
selection has been used in a number of cases to improve 
protein function and two different strategies exist. 
Firstly, randomisation of the entire gene sequence in 
combination with the selection of a variant (mutant) 
protein with the desired characteristics, followed by a new 
round of random mutagenesis and selection. This method can 
then be repeated until a protein variant is found which is 
considered optimal (Schier R. et al , J. Mol . Biol. 1996 263 
(4) : 551-567) . Here, the traditional route to introduce 
mutations is by error prone PGR (Leung et al , Technique, 1: 
11-15, 1989) with a mutation rate of -0.7%. Secondly, 
defined regions of the gene can be mutagenized with 
degenerate primers, which allows for mutation rates up to 
100% (Griffiths et al , EMBO . J, 13: 3245-3260, 1994; Yang 
et al, J. Mol. Biol. 254: 392-403, 1995). The higher the 
mutation rate used, the more limited the region of the gene 
that can be subjected to mutations. 

Random mutation has been used extensively m the field 
of antibodv engineering. In vivo formed antibody genes can 
be cloned m vitro (Larrick et al , Biochem. Biophys . Res. 
Com.mun. 160: 1250-1256, 1989) and random combinations of 
^rj^o Qenes encodma the variable hea\^' and light genes can 



wo 98/58080 



PCT/GB98/01757 



2 

be subjected to selection (Marks et al , Biotechnology, 10: 
779-783, 1992). Functional antibody fx'agments selected can 
be further improved using random mutagenesis and additional 
rounds of selections (Schier R. et al , J. Mol . Biol. 1996 
263 (4) : 551-567) . 

The strategy of random mutagenesis is followed by 
selection. Variants with interesting characteristics can 
be selected and the mutated DNA regions from different 
variants, each with interesting characteristics, are 
combined into one coding sequence (Yang et al , J. Mol. 
Biol. 254: 392-403, 1995). This is a multi-step sequential 
process, and potential synergistic effects of different 
mutations in different regions can be lost, since they are 
not subjected to selection in combination. Thus, these two 
strategies do not include simultaneous mutagenesis of 
defined regions and selection of a combination of these 
regions . Another process involves combinatorial pairing of 
genes which can be used to improve eg antibody affinity 
(Marks et al. Biotechnology, 10: 779-783, 1992) . Here, the 
three CDR- regions in each variable gene are fixed and this 
technology does not allow for shuffling of individual gene 
segments in the gene for the variable domain, for example, 
including the CDR regions, between clones. 

The concept of DNA shuffling (Stemmer, Nature 370: 
389-391, 1994) utilizes random fragmentation of DNA and 
assembly of fragments into a functional coding sequence. 
In this process it is possible to introduce chemically 
synthesized DNA sequences and in this way target variation 
to defined places in the gene which DNA sequence is known 
(Crameri et al , Biotechniques , 18: 194-196, 1995). In 
theory, it is also possible to shuffle DNA between any 
clones. However, if the resulting shuffled gene is to be 
functional with respect to expression and activity, the 
clones to be shuffled have to be related or even identical 
with the exception of a low level of random mutations. DNA 
shuffling between genetically different clones will 
generally produce non- functional genes. 
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Selection of functional proteins from molecular 
libraries has been revolutionized by the development of the 
phage display technology (Parmley et al , Gene, 73: 305-391 
1988; McCafferty et al , Nature, 348: 552-554, 1990; Barbas 
et al, PNAS. USA, 88: 7978-7982, 1991). Here, the phenotype 
(protein) is directly linked to its corresponding genotype 
(DNA) and this allows for directly cloning of the genetic 
material which can then be subjected to further 
modifications in order to improve protein function. Phage 
display has been used to clone functional binders from a 
variety of molecular libraries with up to 10^^ transf ormant s 
m size (Griffiths et al , EMBO . J. 13: 3245-3260, 1994). 
Thus, phage display can be used to directly clone 
functional binders from molecular libraries, and can also 
be used to improve further the clones originally selected. 

Random combination of DNA from different mutated 
clones in combination with selection of desired function is 
a more efficient way to search through sequence space as 
compared to sequential selection and combination of 
selected clones. 

Summary of the invention 

According to one aspect of the present invention, 
there is provided a method for generating a polynucleotide 
sequence or population of sequences from a parent 
polynucleotide sequence encoding one or more protein 
motifs, comprising the steps of 

a) digesting the parent polynucleotide sequence with 
an exonuclease to generate a population of fragments; 

b) contacting said fragments with a template 
polynucleotide sequence under annealing conditions; 

c) amplifying the fragments that anneal to one 
template in step b) to generate at least one polynucleotide 
seq^aence encoding one or more protein motifs having altered 
characteristics as compared to the one or more protein 
motifs encoded by said parent polynucleotide. 

The parent polynucleotide is preferably double- 
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Stranded and the method further comprises the step of 
generating single- stranded polynucleotide sequence from 
said double -stranded fragments prior to step b) , Further, 
the template polynucleotide is preferably the parent 
polynucleotide sequence or at least a polynucleotide 
sequence having sequence in common with the parent 
nucleotide sequence so that the fragments will hybridize 
with the template under annealing conditions. For example, 
if the parent polynucleotide is an antibody, the template 
may be a different antibody having constant domains or 
framework regions in common. 

Therefore, typically, there is provided a method of 
combining polynucleotide fragments to generate a 
polynucleotide sequence or population of sequences of 
desired characteristics, which method comprises the steps 
of : 

(a) digesting a linear parent double- stranded 
polynucleotide encoding one or more protein motifs with an 
exonuclease to generate a population of double stranded 
fragments of varying leiigLhti; 

(b) obtaining single-stranded polynucleotides from 
said double- stranded fragments; and 

(c) assembling a polynucleotide sequence from the 
sequences derived from step (b) . 

Preferably the method further comprises the step of 
(d) expressing the resulting protein encoded by the 
assembled polynucleotide sequence and screening the protein 
for desired characteristics. 

Prior to assembling the polynucleotide sequence in 
step (c) the double stranded sequences are preferably 
purified and then mixed in order to facilitate assembly. By 
controlling the reaction time of the exonuclease the size 
of the polynucleotide fragments may be determined. 
Determining the lengths of the polynucleotide fragments in 
this way avoids the necessity of having to provide a 
further step such as purifying the fragments of desired 
length from a gel . 
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Furt-her, as some exonuciease digests polynucleotide 
sequences from bcnh the 3' and the 5' ends, the fragm.ents 
selected may center around the middle of the gene sequence 
if this particular region of sequence is desired. This 
seq^uence from the m.iddle of a gene may be mutated random.ly 
by, for example, error prone PCR and desirable for the 
shuf f 1 ing process . 

However, in some cases it may be desirable not to 
shuffle the sequence from the middle of the gene. This may 
be prevented by choosing long fragments after exonuciease 
treatment. Conversely, if it is desirable to shuffle the 
middle of the gene sequence short exonuciease treated 
fragments may be used. 

In order to generate a polynucleotide sequence of 
desired characteristics the parent double - stranded 
polynucleotide encoding one or more protein motifs may be 
subjected to mutagenesis to create a plurality of 
differently mutated derivatives thereof. Likewise, a 
parent double-stranded polynucleotide may be obtained 
already encoding a plurality of variant protein motifs of 
unknown sequence . 

Random mutation can be accomplished by any 
conventional method as described above, but a suitable 
method is error-prone PCR. 

It is preferable to use PCR technology to assemble the 
single- stranded polynucleotide fragments into the double- 
stranded polynucleotide sequence. 

The pol^Tiucleot ide sequence is preferably DNA although 
RNA may be used . For simplicity the term polynucleotide 
will novj be used in the following text in relation to DNA 
but It w^ill be appreciated that the present invention is 
applicable to both RNA and DNA. 

Any exonuciease that digests polynucleotide from the 
3' prim.e end to the 5' prime end or fromt both the 2' and 
the 5' end may be used. Examples of a suitable exonuciease 
which m^ay be used m accordance v;ith the present invention 
include 3AL31 and Exonuciease III. 
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BAL31 is a exonuclease that digests and removes 
nucleotide bases from both the 3' and the 5' ends of a 
linear polynucleotide molecule. The enzyme uses Ca2+ as a 
CO- factor which can be bound in complex with EGTA (Ethylene 
Glycol bis (/3-amino ethyl Ether) N, N, N' , N' - tetra acetic 
acid) . EGTA does not bind Mg2+ which is necessary for the 
subsequent PGR process. Linear DNA sequences are digested 
with BAL31 and the reaction stopped at different time 
points by the addition of EGTA. The individual digested 
fragments are purified, mixed and reassembled with PGR 
technology. The assembled (reconstituted) gene may then be 
cloned into an expression vector for expressing the 
protein. The protein may then be analyzed for improved 
characteristics . 

The PGR technique uses a template, which may be the 
wild type sequence or a reconstituted sequence in 
accordance with the present invention. The fragments 
hybridize with the template at the appropriate regions 
(i.e. where the homology between the two strands is at its 
highest) and the remaining sequence xs genexaLed by 
elongation of the fragment using the template in accordance 
with the PGR technique. 

The method of the present invention provides several 
advantages over known shuffling techniques. For example, in 
other DNA shuffling techniques the process itself 
introduces mutations over the entire gene sequence. The 
present invention allows for the concentration of mutations 
on i) the flanking regions after recombination of wild type 
fragments, on either an already recombined template created 
by the method of the present invention, a template mutated 
in any other way or a gene (or gene combination, for 
example, a combination of antibody genes) having a desired 
sequence; or ii) the middle region after recombination of 
mutated fragments created by the method of the present 
invention on a wild type template. 

In other words, if it is desirable to provide a gene 
having mutations concentrated in its flanking regions, a 
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wild type fragment relating ro the middle region of rhe 
gene may be used in conjunction wioh a reconstituted and/ or 
mutated template sequence for the PGR process. In this 
way, the PGR process generates complementary sequence to 
the reconst ituted/mutated template sequence as it elongates 
the wild type fragment. Therefore, the resulting sequence 
will have substantially a middle region corresponding to 
the wild type sequence and flanking regions with 
incorporated mutations . 

Conversely, if it is desirable to provide a gene 
having mutations concentrated in its middle region, a 
reconstituted and or mutated fragment corresponding to the 
m.iddle region of the gene may be used in conjunction with 
a wild type template in the PGR process. In this way, the 
PGR process, by elongating the mutated fragment using the 
wild type template, generates a sequence having 
substantially a mutated middle region and wild type 
flanking regions. 

Further, the method of the present invention produces 
a set of progressively shortened DNA fragments for each 
time point a DNA sample is taken from the BAL31 treatment. 
The DNA samples may be collected and pooled together or, 
optionally, individual samples may be chosen and used in 
the method- Thus the present invention allows a selection 
of what DNA samples are to be used in the recombination 
system and thereby offers a further degree of control . 

The method of the present invention may be carried out 
on any polynucleotide which codes for a particular product 
for example any protein having binding or catalytical 
properties e.g. antibodies or parts of antibodies, enzymes 
or receptors. Further, any polynucleotide that has a 
function that may be altered for example catalytical PJNIA 
may be shuffled m accordance with the present invention. 
It is preferable that the parent polynucleotide encoding 
one c-r more protein m.otif is at least 12 nucleotides in 
length, more preferably at least 20 nucleotides in length, 
even more preferably more than 5 0 nucleotides in length. 
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Polynucleotides being at least 100 nucleotides in length or 
even at least 200 nucleotides in length may be used. Where 
parent polynucleotides are used that encoded for large 
proteins such as enzymes or antibodies, these may be many 
hundreds or thousands of bases in length. The present 
invention may be carried out on any size of parent 
polynucleotide , 

The present invention also provides polynucleotide 
sequences generated by the method described above having 
desired characteristics. These sequences may be used for 
generating gene therapy vectors and replication-defective 
gene therapy constructs or vaccination vectors for DNA- 
based vaccinations. Further, the polynucleotide sequences 
may be used as research tools. 

The present invention also provides a polynucleotide 
library of sequences generated by the method described 
above from which a polynucleotide may be selected which 
encodes a protein having the desired characteristics. It is 
preferable that the polynucleotide library is a DNA or cDNA 



The present inventions also provides proteins such as 
antibodies, enzymes, and receptors having characteristics 
different to that of the wild type produced by the method 
described above. These proteins may be used individually or 
within a pharmaceut ically acceptable carrier as vaccines or 
medicaments for therapy, for example, as immunogens , 
antigens or otherwise in obtaining specific antibodies. 
They may also be used as research tools. 

The. desired characteristics of a polynucleotide 
generated by the present invention or a protein encoded by 
a polynucleotide generated by the present invention may be 
any variation in the normal activity of the wild type 
(parent) polynucleotide or the polypeptide, protein or 
protein motifs it encodes. For example, it may be 
desirable to reduce or increase the catalytic activity of 
an enzyme, or improve or reduce the binding specificity of 
an antibody. Further, if the protein, or polynucleotide is 
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an immunogen, it: may be desirable to reduce cr increase its 
ability to obtaiLn specific antibodies against it. The 
oarent oolynucleot ide preferably encodes one or more 
protein motifs. These are defined by regions of 
5 polynucleotide sequence that encode pcl\^eptide sequence 

having or potentially having characteristic protein 
function. For example, a protein motif miay define a portion 
of a whole protein, i.e. an epitope or a cleavage site or 
a catalytic site etc. However, within the scope of the 
10 present invention, an expressed protein motif does not have 

to display activity, or be "correctly" folded. 

It mtay be desirable to modify a protein so as to alter 
the conf ormiation of certain epitopes, thereby improving its 
antigenicity and/or reducing cross - react ivity . For example, 
15 should such a protein be used as an antigen, the 

modification may reduce any cross - react ion of raised 
antibodies with similar proteins. 

Although the term "enzyme" is used, this is to 
interpreted as also including any polypeptide having enzyme 
20 -like activity, i.e. a catalytic function. For example, 

polypeptides being part of an enzyme may still possess 
catalytic function. Likewise, the term "antibody" should be 
construed as covering any binding substance having a 
binding domain with the required specificity. This includes 
25 antibody fragments, derivatives, functional equivalents and 

homologues of antibodies, including synthetic molecules and 
molecules whose shape mimics that of an antibody enabling 
it to bind an antigen or epitope. Examples of antibody 
fragments-, capable of binding an antigen or other binding 
3 0 Dartner are Fab fragment consisting of the VL , VH , Cl and 

CHI domains , the Fd fragment consisting of the VH and CHI 
domains; the Fv fragment consisting of the VL and \rH 
domains of a single arm. cf an antibody; the dAb fragment 
which consists of a 'v^ri domain; isolated CDR regions and 
25 F(ab')2 fragments, a bivalent fragment including tv;o Fab 

fragments linked by a disulphide bridge at the hinge 
reaicn. Smale chain Fv fragments are also included. 
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In order to obtain expression of the generated 
polynucleotide sequence, the sequence may be incorporated 
in a vector having control sequences operably linked to the 
polynucleotide sequence to control its expression. The 
5 vectors may include other sequences such as promoters or 

enhancers to drive the expression of the inserted 
polynucleotide sequence, further polynucleotide sequences 
so that the protein encoded for by the polynucleotide is 
produced as a fusion and/or nucleic acid encoding secretion 
10 signals so that the protein produced in the host cell is 

secreted from the cell . The protein encoded for by the 
polynucleotide sequence can then be obtained by 
transforming the vectors into host cells in which the 
vector is functional, culturing the host cells so that the 
15 protein is produced and recovering the protein from the 

host cells or the surrounding medium. Prokaryotic and 
eukaryotic cells are used for this purpose in the art, 
including strains of E. coli, yeast, and eukaryotic cells 
such as COS or CHO cells. The choice of host cell can be 
20 used to control the properties of the protein expressed in 

those cells, e.g. controlling where the protein is 
deposited in the host cells or affecting properties such as 
its glycosylat ion . 

The protein encoded by the polynucleotide sequence may 
25 be expressed by methods well known in the art. 

Conveniently, expression may be achieved by growing a host 
cell in culture, containing such a vector, under 
appropriate conditions which cause or allow expression of 
the protein . 

30 Systems for cloning and expression of a protein in a 

variety of different host cells are well known. Suitable 
host cells include bacteria, eukaryotic cells such as 
mammalian and yeast, and baculovirus systems. Mammalian 
cell lines available m the art for expression of a 

35 heterologous polypeptide include Chinese hamster ovary 

cells, HeLa cells, baby hamster kidney cells, COS cells and 
many others. A common, preferred bacterial host is E. 
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Su:Lt:able vecnors can be chosen or consrrucced, 
contamiLng approprxace regulacory sequences, :inciuding 
Dromouer sequences, cerminanor fragments, polyadenylat ron 
5 sequences, enhancer sequences, riarker genes and other 

secuences as appropriane. Veccors may be plasmids, viral 
e.g. 'phage, or phagemid, as appropriate. For further 
details see, for example, Molecular Cloning: a Laboratory 
Manual: 2nd edition, Sambrook et al . , 1989, Cold Spring 
10 Harbor Laboratory Press. Many known techniques and 

protocols for manipulation of polynucleotide sequences, for 
examole m preparation of polynucleotide constructs, 
mutagenesis, sequencing, introduction of DNA into cells and 
gene expression, and analysis of proteins, are described in 
15 detail in Current Protocols in Molecular Biology, Ausubel 

et al . eds . , John Wiley ^ Sons, 1992. 

The FIND system can be used for the creation of DNA 
libraries comprising variable sequences which can be 
screened for the desired protein function in a number of 
20 ways. Phage display may be used for selecting binding 

{Griffith et al . , EMBO J. 113: 3245-3260, 1994); screening 
for enzyme function {Crameri A. et al , Nature 1998 15; 391 
(6664) :28S-291; Zhang J. K. et al , PNAS . USA 1997 94 (9): 
4504-4509; Warren M.S. et al , Biochemistry 1996, 9; 35(27): 
25 8855-8862) . 

A protein provided by the present invention may be 
used m screening for molecules which affect or modulate 
its activity or function. Such molecules may be useful in 
a therapeutic (possibly including prophylactic) context. 
3Q The Dresent invention also provides vectors comprising 

pol\Tiucleot ide sequences generated by the method described 
above . 

The oresent inventions also provides compositions 
comtjrismc; either polynucleotide sequences, vectors 
35 comprising the polynucleotide sequences or proteins 

aenerated by the m.ethod described above and a 
pharmaceutically acceptable carrier or a carrier suitable 
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for research purposes. 

The present invention also provides a method 
comprising, following the identification of the 
polynucleotide or polypeptide having desired 

5 characteristics by the method described above, the 

manufacture of that polypeptide or polynucleotide in whole 
or in part, optionally in conjunction with additional 
polypeptides or polynucleotides. 

Following the identification of a polynucleotide or 
10 polypeptide having desired characteristics, these can then 

be manufactured to provide greater numbers by well known 
techniques such as PGR, cloning a expression within a host 
cell. The resulting polypeptides or polynucleotides may be 
used in the preparation of medicaments for diagnostic use, 
15 pharmaceutical use, therapy etc. This is discussed further 

below. Alternatively, the manufactured polynucleotide, 
polypeptide may be used as a research tool, i.e. antibodies 
may be used in immunoassays, polynucleotides may be used a 
hybridization probes or primers. 
20 The po]ypeptides or polynucleotides generated by the 

method of the invention and identified as having desirable 
characteristics can be formulated in pharmaceutical 
compositions. These compositions may coiriprise, in addition 
to one of the above substances, a pharm.aceut ically 
25 acceptable excipient, carrier, buffer, stabilizer or other 

materials well known to those skilled in the art. Such 
materials should be non- toxic and should not interfere with 
the efficacy of the active ingredient. The precise nature 
of the carrier or other material may depend on the route of 
3 0 administration, e.g. oral, intravenous, cutaneous or 

subcutaneous, nasal, intramuscular, intraperitoneal routes. 

Pharmaceutical compositions for oral administration 
may be in tablet, capsule, powder or liquid form. A tablet 
may include a solid carrier such as gelatin or an adjuvant. 
35 Liquid pharmaceutical compositions generally include a 

liquid carrier such as water, perroleum, animal or 
vegetable oils, mineral oil or synthetic oil. 
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Physiological saline solution, dextrose or other saccharide 
solution or glycols such as eiihylene glycol, propylene 
glycol or polyethylene glycol may be included. 

For intravenous, cutaneous or subcutaneous injection, 
or injection at the site of affliction, the acrive 
ingredient will be in the form of a parenterally acceptable 
aqueous solution which is pyrogen-free and has suitable pH, 
isotonicity and stability. Those of relevant skill in the 
art are well able to prepare suitable solutions using, for 
example, isotonic vehicles such as Sodium Chloride 
Injection, Ringer's Injection, Lactated Ringer's Injection, 
Preservatives, stabilizers, buffers, antioxidants and/or 
other additives may be included, as required. 

Whether it is a polypeptide, e.g. an antibody or 
fragment thereof, an enzyme, a polynucleotide or nucleic 
acid molecule, identified following generation by the 
present invention that is to be given to an individual, 
administration is preferably in a "prophylact ically 
effective amount" or a "therapeutically effective amount" 
(as the case may be, although prophylaxis may be considered 
therapy) , this being sufficient to show benefit to the 
individual. The actual amount administered, and rate and 
time-course of adm.inistrat ion , will depend on the nature 
and severity of what is being treated. Prescription of 
treatment, e.g. decisions on dosage etc, is within the 
responsibility of general practitioners and other medical 
doctors, and typically takes account of the disorder to be 
treated, the condition of the individual patient, the site 
of delivery, the method of administration and other factors 
known to practitioners. Examples of the techniques and 
protocols mentioned above can be found in Remington's 
Pharmaceutical Sciences, 16th edition, Osol , A. (ed), 1980. 

Alternatively, targeting therapies miay be used to 
deliver the active agent more specifically to certain t\^es 
of cell, by the use of targeting systems such as antibody 
or cell specific ligands. Targeting may be desirable for 
a variety of reasons; for example if the agent is 
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unacceptably toxic, or if it would otherwise require too 
high a dosage, or if it would nor otherwise be able to 
enter the target cells. 

Instead of administering these agents directly, they 
could be produced in the target cells by expression from an 
encoding gene introduced into the cells, e.g. in a viral 
vector (a variant of the VDEPT technique) . The vector 
could be targeted to the specific cells to be treated, or 
it could contain regulatory elements which are switched on 
more or less selectively by the target cells. 

Alternatively, the agent could be administered in a 
precursor form, for conversion to the active form by an 
activating agent produced in, or targeted to, the cells to 
be treated. This type of approach is sometimes known as 
ADEPT or VDEPT; the former involving targeting the 
activating agent to the cells by conjugation to a cell- 
specific antibody, while the latter involves producing the 
activating agent, e.g. an enzyme, in a vector by expression 
from encoding DNA in a viral vector (see for example, EP-A- 
415711 and TaIO 90/07936) . 

A composition may be administered alone or in 
combination with other treatments, either simultaneously or 
sequentially dependent upon the condition to be treated. 

As a further alternative, the polynucleotide 
identified as having desirable characteristics following 
generation by the method of the present invention could be 
used in a method of gene therapy, to treat a patient who is 
unable to synthesize the active polypeptide encoded by the 
polynucleotide or unable to synthesize it at the normal 
level, thereby providing the effect provided by the 
corresponding wild-type protein. 

Vectors such as viral vectors have been used in the 
prior art to introduce polynucleotides into a wide variety 
of different target cells. Typically the vectors are 
exposed to the target cells so that transfection can take 
place in a sufficient proportion of the cells to provide a 
useful therapeutic or prophylactic effect from the 
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exDression of the desired polypeptide. The transfec 
nucleic acid may be periTLanent ly incorporated inoo the 
aenome of each of the targeted tumour cells, providing long 
lasting effect, or alternatively the treatment may have to 
be repeated periodically. 

A variety of vectors, both viral vectors and plasmiid 
vectors, are known in the art, see US Patent No. 5,252,4 79 
and WO 93/07282. In particular, a number of viruses have 
been used as gene transfer vectors, including 
papovaviruses, such as SV4 0, vaccinia virus, herpes 
viruses, including HSV and EBV, and retroviruses. Many 
gene therapy protocols in the prior art have used disabled 
murine retroviruses . 

As an alternative to the use of viral vectors ether 
known methods of introducing nucleic acid into cells 
includes electroporation, calcium phosphate co- 
precipitation, mechanical techniques such as 
microinjection, transfer mediated by liposomes and direct 
DNA uptake and receptor-mediated DNA transfer. 

As mentioned above, the aim of gene therapy using 
nucleic acid encoding a polypeptide, or an active portion 
thereof, is to increase the amount of the expression 
product of the nucleic acid in cells in which the level of 
the wild-type polypeptide is absent or present only at 
reduced levels. Such treatment may be therapeutic in the 
treatment of cells which are already cancerous or 
prophylactic in the treatment of individuals known through 
screening to have a susceptibility allele and hence a 
predisposition to, for example, cancer. 

The oresent invention also provides a kit for 
Qenerating a polynucleotide sequence or population of 
sequences of desired characteristics comprising an 
exonuclease and components for carrying out a PGR 
technique, for example, thermostable DNA ^ nucleot ides ) and 
a stopping device, for example, EGTA . 

The oresent applicants have termed the technology 
described above as FIND (Fragment Induced Nucleotide 
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Diversity) . 

As outlined above, the FIND programme, in accordance 
with the present invention conveniently provides for the 
creation of mutated antibody gene sequences and their 
random combination to functional antibodies having 
desirable characteristics. As an example of this aspect of 
the invention, the antibody genes are mutated by error 
prone PGR which results in a mutation rate of approximately 
0.7%. The resulting pool of mutated antibody genes are then 
digested with an exonuclease, preferably BAL31, and the 
reaction inhibited by the addition of EGTA at different 
time points, resulting in a set of DNA fragments of 
different sizes. These may then be subjected to PGR based 
reassembly as described above. The resulting reassembled 
DNA fragments are then cloned and a gene library 
constructed. Clones may then be selected from this library 
and sequenced. 

A further application of the FIND technology is the 
generation of a population of variable DNA sequences which 
can be used for further selections a-nd analyses. Besides 
encoding larger proteins, e.g. antibody fragments and 
enzymes, the DNA may encode peptides where the molecules 
functional characteristics can be used for the design of 
different selection systems. Selection of recombined DNA 
sequences encoding peptides has previously been described 
(Fisch et al PNAS . USA 1996 Jul 23; 93 (15) : 7761-7766) . In 
addition, the variable DNA population can be used to 
produce a population of RNA molecules with e.g. catalytic 
activities. Vaish et al (PNAS. USA 1998 Mar 3; 95 (5): 
2158-2162) demonstrated the design of functional systems 
for the selection of catalytic RNA and Eckstein F (Ciba 
Found. Symp . 1997; 209; 207-212) has outlined the 
applications of caiialytic RNA by the specific introduction 
of catalytic RNA in cells. The FIND system may be used to 
further search through the sequence space in the selection 
of functional pept ides /molecules with catalytic activicies 
based on recombined DNA sequences. 
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Aspects and embodiments of the present z.nvent^on will 
nov; be illustrated, by v;ay of example, v;ioh reference to 
the accompanying figures. Further aspects and emibodiments 
will be apparent to those skilled in the art. All documents 
mentioned in this text are incorporated nerein by 
re f erence . 

Brief description of the drawinQS 

Figirre 1 shows the principle steps in the shuffling of 
specific DNxA sequences between different clones; 

Figure 2 shows the principle steps m the PGR 
elongation of exonuclease treated gene sequences; 

Figure 3 shows the principle steps m the PGR 
elongation of long f ragm.ents of exonuclease treated gene 
sequences. The use of long fragments results m the middle 
region of the gene not being recombined. This region may 
however contain random mutations and the middle of the gene 
sequence may thus differ form other clones. The middle 
region of the sequence may differ in length, but by using 
longer primers the middle region may be covered; 

Figure 4 shows the principle steps m the PCR 
elongation of short fragments of exonuclease treated gene 
secaiences. The use of short fragments results in the middle 
region of the gene being recombined. If a longer reaction 
time is used for the exonuclease digestion a set of 
fragments of differing lengths are produced. If the 
fragments are short, some fragments will be located away 
from the middle region of the gene sequence thereby 
allowing recombination of the miiddle sequence; 

Figure 5 shows the appearance of DNA at different 
fixed time intervals after digestion w^iuh BAL31 Nuclease. 
The DNA was mixed with the enz\mae and incubated at 3 0*^0, At 
different time points samples v;ere removed and the 
enz\mnatic activity stopped oy addition of 2 Omr< EGTA . Tne 
samples from the different time points were purified and 
analyzed on a 2% agarose gel. The samples are indicated as 
follows: 1Kb = DNA molecular marker 1; 2 - 10m = 2 to 10 
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minutes BAL31 incubation samples; 

Figure 6 shows A) the theoretical insert after 
restriction digestion of the fragment resulting from the 
primer combination FIND 1, pBR322 Nhel -forward -STOP - 
primer with pBR322 -EagI -reversed-primer . This is termed 
FIND 1 and SEQ ID #5; and B) the theoretical insert after 
restriction digestion of the fragment resulting from the 
primer combination pBR322 Hindlll forward primer and pBR322 
Sail reverse stop primer. This is termed FIND 3 and SEQ ID 
#6; and 

Figure 7 shows the experimentally determined sequences 
of the 2 first FIND clones after automated sequencing. A) 
shows FIND 1 sequence with the STOP codon marked in bold 
(SEQ ID #7) ; and B) shows the FIND 3 sequence with the STOP 
codon shown in underline text (SEQ ID #8) . 

Figure 8 shows the sequence of pEXmide V (4055bp) 
Ncol- and Sal I- sites are marked in underlined text (SEQ 
ID #9) . 

Dp^t ailed descri pt ion and exemplification of the invention 
One aspect of the DNA shuffling procedure can be 
illustrated by the steps shown in Figure 1. The gene 
encoding the tetracycline-resistance (Tet-P.) in the plasmid 
pBR322 is used in this example. Two clones were generated 
by site directed mutagenesis: one with an engineered stop 
codon close to the 5' terminus and one with a stop codon 
close to the 3' terminus of the Tet-R gene. The phenotype 
of these two genes is tetracycline sensitive. By mixing the 
two clones in equimolar amounts and digesting with BAL31 
revertants were selected. After cloning the reassembled 
genes (with combination between the two genes carrying the 
two stop codons) revertants with a frequency of 16% were 
detected, i.e. 16% of the clones were tetracycline 
resistant. The experiment used the ampicillin-resistance in 
pBR322 for primary selection and then individual Amp-R 
clones were tested under tetracycline selection (see the 
overview in Fig. 1 and the theoretical view in Fig. 2) . 
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A more detia^led description of examples of the present 
invention are aiven belov.^. 
Reagents : 

AmpliTaq^ pol\ar.erase was purchased from Perkm-Elmer 
Corp., dNTPs from Boehringer Mannheim Biochemica (Mannheim, 
Germ.any) , and BAL3 1 Nuclease from New England Biolabs Inc. 
(Beverly, USA) . Klenow enzyme was purchased from Amersham. 
All restriction enz\'Tnes were purchased from Boehringer 
Mannheim Biochemica (Mannheim, Germany) . Ethidium bromide 
was purchased from Bio-Rad Laboratories (Bio-Rad 
Laboratories, Hercules, CA, USA), T4 DNA Ligase was 
purchased from Appligene Inc. (Pleasanton, CA, USA). 

All primers were designed in the laboratory and 
s\Tithesized with an Applied Biosystems 3 91 DNA- synthes i ser . 



PGR: 

All Polymerase Chain Reactions (PCR) were carried out 
in a automatic thermocycler (Perkin-Elmer Cetus 480, 
Norwalk, CT,USA). PCR techniques for the amplification of 
nucleic acid are described in US Patent No. 4,683,195. The 
PCR reactions were run at varying amounts of cycles 
consisting of following profile: denaturation (94°C, 1 
minute) , primer annealing (55°C, 1 minute) and extension 
(72°C, 1 minute) using a 1 second ramp time. The PCR 
reactions contained, unless otherwise noted, 5/il of each 
primer (20^M), 8^1 of dNTP (1.25mM each of dTTP , dATP , dCT? 
and dOT?) , 10^1 lOx reaction buffer, O.Sfil AmpliTaq^ 
thermostable DNA pol\mrierase (BU/mD (Perkin-Elmer Corp.), 
and water to a final volum.e of lOO^il. In all PCR 
experiments these parameters were used and the numiber of 
reaction cycles was varied. Preferences for the general use 
of PCR techniaues include Mullis et al , Cold Spring Harbor 
S\nT.o . Quant. Biol., 51:263, (1987), Ehrlich (ed), PCR 
technology, Stockton Press, NY, 1985, Ehrlich et al , 
Bc2.^nce, 2 52:1643 -1650, (1991), "PCR protocols; A Guide to 
Methods and Applications", Eds. Innis et al , Academic 
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Sequencing : 

All construcrs have been sequenced by the use of a Taq 
Dyedeoxy"" Terminator Cycle Sequencing Kit. The sequencing 
was performed on a ABI Prism 3 73 DNA Sequencer. 

Agarose electrophoresis : 

Agarose electrophoresis of DNA was performed with 2% 
agarose gels composed of 1% NuSieve® GTG® Low Melting 
AGAROSE (FMC Bioproducts, Rockland, ME, USA) and 1% 
AMRESCO® Agarose (AMRESCO, SOLON, OH, USA) with 0.25Mg/ml 
ethidium bromide in Tris-acetate buffer (TAE-buffer 0 . 04M 
Tris-acetate , O.OOIM EDTA) . Samples for electrophoresis 
were mixed with a sterile filtrated loading buffer composed 
of 25% Ficoll and Bromphenolic blue and loaded into wells 
in a the 2% agarose gel. The electrophoresis was run at 90 
V for 45 minutes unless otherwise stated in Tris-acetate 
buffer with 0.25/ig/ml ethidium bromide. Bands of 
appropriate size were gel-purified using the Qiaquick Gel 
Extraction Kit (Qiagen GmbH, Hilden, Germany) . As molecular 
weight standard, DNA molecular weight marker 1 (Boehringer 
Mannheim GmbH, Germany) was used. The DNA-concentrat ion of 
the gel extracted products were estimated using a 
spectrophotometer {see Fig, 5) , 

Bacterial Strains : 

The Escherichia, coli-strain E,coli BMH71-18 (supE thi 
A(Iac-proAB) F' [proAB' lacl"^ A ( lacZ) MIS ] ) , was used as a 
bacterial host for transformations. Chemically competent 
cells of this strain were produced basically as described 
Hanahan, D. 1983. Srudies on transformation of Escherichia 
coli with plasmids. J. Mol . Biol. 166: 557-580. 
Electrocompetent cells of this bacterial strain were 
produced (Dower, W.J., J. F. Miller, and C.W. Ragsdale . 
1988: High efficiency transformation of E.coli by high 
voltage elect roporat ion . Nucleic Acids Res. 16:6127) . 
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Plasmids : 

The cerracycl une resiStance-gene of pBR322 is 1151 bp 
(basepairs) long. A deleted tetracycline resistance -gene 
variant of plasmid pBR.322 was constructed by cleavzLng the 
plasmid with the restriction enzymes Sail and 3amHI , This 
resulted in removal of a 276 bp fragment inside the 
tetracycline gene. A cleavage reaction w^ith Hindlll and 
EagI and the deleted plasmid would theoretically lead to a 
634 bp cleavage-product, whereas a wildtype pBR322 cleaved 
with these enzymes produces a 910 bp product. The resulting 
protr^ading single stranded overhangs on the deleted plasmid 
after cleavage were treated w^ith Klenow enzyme to generate 
double- stranded ends at both ends of the plasmid. These 
ends were then blunt-end ligated according to Molecular 
cloning; A LABORATORY MAl^IUAL (Second Edition, Cold Spring 
Harbor Laboratory Press, 1989). The resulting plasmid was 
transformed into chemically competent E,coli BMH71-18 and 
plated onto ampillicin-containing plates (100 ^g/ml). When 
replated onto tetracycline- containing agarplates (10 ^g/ml) 
the colc-nies were tetracycline sensitive. 

External primers : 

Two external primers surrounding the tetracycline gene 
of pBR322 were designed with the following sequences 
including designated unique restriction sites: 

pBR322 Hindlll forward primer: 

5 ' -CAGCTTATCATCGAT7^.GCTTTAATGCGGTAGTTTAT-3 ' (SEQ ID #1) 
and pBR3 2 2 - EagI - reversed - primer : 

5 ' -CGTAGCCCAGCGCGTCGGCCGCCATGCCGGCGATAATG-3 ' (SEQ lO #2) 

To show that the two external primers covers the 
functional parts of the t etracycl ine -gene , a PGR reaction 
with the above mentioned profile was used for a 3 0 cycles 
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PGR with pBR322 (250 ng ) as a template and the external 
primers described above. This yielded a PCR-product of 910 
bp after subsequent cleavage with Hindlll and EagI . When 
this restriction product was cloned in a likewise 
5 restriction-digested pBR322 plasmid, the plasmid encoded a 

tetracycline resistant phenotype . This was detected after 
transformation of a ligation of plasmid and 910 bp PCR- 
product into E.coli host BMH 7118 plated on tetracycline 
containing agar-plates (10 /ig/ml) . 

10 

STOP-containing primers: 

Two pBR322 forward mutagenic primers and two pBR3 22 
reversed primers containing unique restrict ion- si tes and 
one STOP codon each at various sites were constructed. 
15 These were : 

pBR322 Nhel forward STOP: 
5 ' - 

CACTATGGCGTGCTGCTAGCGCTATATGCGTTGATGCAATTTCTATGAGCACCCGTT 
20 CT -3' . (SEO ID #3) 

pBR3 2 2 Sail reversed STOP: 

5 ' - 

TCTCAAGGGCATCGGTCGACGCTCTCCCTTATGCGACTCCTGCATTAGGAATCAGCC 
2 5 CAGTAGTA -3' (SEQ ID #4) 

Generation of STOP-codon containing variants of pBR322 
plasmids . 

Four different variants of the tetracycline gene were 
30 constructed. A combination of one mutated forward or 

■ reversed primer with the corresponding external forward or 
reversed primer was used in PGR- react ions to generate 
mutated inserts. Plasmid pBR322 was used as a template (250 
ng) m 40 PCR-cycles. The resulting restriction digested 
35 fragments were then cloned into tetracycline deleted 

pBR322, and the resulting clones were called FIND 1 and 
FIND 3 
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The follov;ing primer combinations were used: 
FIND I, ■03R322 Nhel - f orv;ard- STO?-prxmer with p3R322-EagI- 
reversed-primer . This combination gave the insert after 
restriction digestion as shown m Figure 6A; and FIND 3, 
5 p3R322 Hindlll forward primer and p3R322 Sail reversed STOP 

primer. This combination gave the insert after restriction 
digestion as shown m Figure 6B . 

The amplified PCR-products were analysed on a 2% 
agarose gel. The electrophoresis was run at 90V for 40 
10 minutes as described above. Bands of appropriate size 

(lOOObo) , as compared to the molecular weight standard, 
were cut out and gel-purified using the Qiaquick Gel 
Extraction Kit. The four different STOP - containing inserts 
v;ere then cleaved with the restriction-enzymes designated 
]_5 in the primers above. For each insert a pool of plasmid 

pBR3 2 2 was cleaved with the same enzymes , and these four 
combinations were then ligated and transformed into 
chemically competent E coli BMH 71-18 according to the 
modified protocol of Detlef (Modified Hanahan, revised M. 
20 Scott, F. Hochstenbach and D. Giissow 1989). The 

transf ormants were plated onto ampicillin containing agar- 
plates (50 ^g/ml) . When replated on tetracycline containing 
agar-plates (10 yig^/TV.\) no colonies survived, confirming the 
functional effect of the introduced STOP-codon m the 
25 tetracycline-gene . Plasmids of the four different FIND- 

clones were prepared with Qiagen Plasmid Midi Kit (Qiagen 
Inc., Chatsworth, CA, USA). The plasmids of the four clones 
were sequenced by the use of a Taq Dyedeoxy" Terminator 
Cycle Sequencing ?:it . The sequencing was performed on a ABI 
30 Prism 373 DNA Sequencer. The STOP-codons were confirmed and 

the inserts to be correct . 



FIND EXPERIMFnsrr I : 

Generation of FIND- fragments for BAL31 Nuclease digestion. 

PGR- fragment of PINT) 1 and FIND 3 v;ere generated by 
PCP^- react ions v;ith FIND 1 and FIND 3~plasm.ids as 
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templates (500 ng) and with the two external primers, 
pBR322 Hindlll forward pro^mer and pBR322 -EagI -reversed- 
primer. PCR-cycles were as described above for 30 cycles. 
The amplified PCR-products were mixed with 20 /il of loading 
buffer (25% Ficoll and Eromphenolic blue) and analysed on 
a 2 % agarose gel . The electrophoresis was run at 90V for 
3 5 minutes as previously described. Bands of appropriate 
size were cut out and gel-purified using the Qiaquick Gel 
Extraction Kit. The DNA- concentration was estimated to 
112.25 /ig/ml for the FIND-1 PGR- fragment and to 110 /ig/ml 
for the FIND- 3 PGR- fragment . 

BAIi31 Nuclease treatment: 

5 fig each of FIND 1 and FIND 3 PGR- f ragm.ent s (Fig. 7 
A and B) were mixed in equimolar amounts together with 
lOO/il of 2x BAL31 buffer and 10/xl sterile water to a final 
volume of 200/il. A smaller volume of 22.5^x1 was prepared to 
be used as an enzymatically untreated blank. This consisted 
of 4.5ptl FIND 1-fragment and 4.5/^1 of FIND 3, 11.25/il 2x 
BALi31 nuclease buffer and 2.25/xl sterile water. 1.5ml 
sterile eppendorf tubes with DNA and 2x BAL31 nuclease 
buffer and water as described were pre- incubated in a 30°C 
water-bath in a cold-room of -f4°G for 10 minutes. Meanvjhile 
five sterile eppendorf tubes were prepared with 4/il each of 
a 200mM solution of EGTA. These were marked 1-9 minutes. In 
the same way a tube with 2 . 5//1 200 mM EGTA was prepared for 
the blank untreated DNA-solution. The working concentration 
of EGTA is 20mM. After the 10 minutes pre - incubation BAL31 
Nuclease was added to the tube with the larger volume to a 
final concentration of 1 Unit//ig of DNA (lO^il of 1 U/fil 
solution) . After t= 1, 3, 5, 7 and 9 minutes the tube was 
mixed and samples of 3 6/il was removed and added to the 
tubes wirh 4ul of EGTA and placed onto ice. At the same 
time the blank volume of 22 . 5/xl was removed and added to 
the prepared 2.5^1 of EGTA and also placed on ice. The 
tubes were then placed in a 65 °C water-bath for heat 
inactavation of the enzyme and then replaced onto ice. 
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Purification of digestion produced fragments: 

The volumes in the cubes were correcred to lOQ/il each 
and a phenol/chlorcf orm/isoamylalcohol exrracrion was 
performed. SOpil of buffered phenol was added zo each tube 
together with 50/il of a mixture of chloroform and 
isoamylalcohol (24:1). The tubes were vortexed for 30 
seconds and then centrifuged for 1 mxnute in a microfuge at 
14000 r.p.m. The upper phase was then collected and mixed 
with 2.5 volumes of 99.5% Ethanol (l/lO was 3M Sodium 
Acetate, pK 5.2) . The DNA was precipitated for 1 hour in - 
80°C. The DNA was then pelleted by cent ri fugat ion for 30 
minutes in a microfuge at: 14.000 r.p.m.. The pellet was 
washed once with 70% ethanol and then re-dissolved m 10^1 
of sterile v;ater. 

Analysis of digestion produced purified fragments on 
agarose gel : 

5fil of the dissolved pellet from each time point and 
from the blank were mixed with 2.5/il of loading buffer (2 5% 
Ficoll and Bromphenolic blue) and loaded into wells in a 2% 
agarose gel. The electrophoresis and subsequent gel 
extraction of the different timepoints were performed as 
above . 

Reassembly PGR with BALiBl Nuclease generated fragments: 

The remaining of the dissolved pellet from each 

time point after phenol -extract ion and precipitation were 
miixed m a PCR-reassemibly without primers. A portion of 5^1 
from, the untreated blank was added as template to make it 
possible to generate full lengrih fragm.ents. 4 0 PCR-cycles 
were run with the PCR-profile and reaction m.ixture as 
described above, but v;ithoui: any primers. 

PGR with external primers to increase the amount of 
reassembled PGR-products : 

50/il of the reassembled PCF.-product was miixed v;ith PGR 
reagents including zhe two external primers as described 
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above to generate a 100^1 PGR reaction. This PGR was run 
for 25 cycles with the profile described above. The 
amplified PCR-product was analysed on a agarose gel . A band 
of approximately 1000 bp was visible on the gel after the 
second PGR with the two external primers . The remaining 
50/il from the first reassembly PGR, showed only a smear of 
bands spanning the whole interval of the molecular weight 
marker. The 1000 -bp fragment after the second PGR was 
excised and gel-purified as described previously. 

Restriction digestion of reassembled FIND- fragment and 
tetracycline sensitive pBR322 with Hindlll and EagI : 

10 fig of tetracycline deleted pBR322 (lO/il) was cleaved 
with 2/il each of the enzymes Hindlll (lOU/^1) and EagI 
(10U//zl) (4U enzyme/^g vector) in a mixture with lO/il 
lOxbuffer B (supplied with the enzymes) and water to lOO/il. 
All of the agarose purified reassembled FIND- fragment was 
cleaved with the same enzymes in a similar 100 fj.! reaction 
mixture. The tubes were incubated in a 37 °C water bath for 
14 hours . 

Gel purification of restriction digested vector and 
restriction digested reassembled FIND-f ragment : 

The cleavage reactions were mixed were analysed on a 
2% agarose gel. The restriction digested tetracycline- 
deleted pBR322 showed a cleavage product of about 600 bp. 
This corresponds well with the expected size of 63 5 bp. The 
band of the cleaved plasmid was cut out and gel -extracted 
as previously described. The reassembled cleaved FIND- 
product was about 1000 bp long and was gel extracted in the 
same manner as the plasmid. 

Spectrophotometer estimations of the restriction 
digested-plasmid and FIND- fragment gave the following 
indications of DNA- concent rat ions : plasmid 13.5pig/ml; 
reassembled cleaved FIND- fragment 77.3^g/ml. 

Ligation of reassembled restriction digested FIND- fragment 
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with tetracycline deleted restriction digested pBR322 : 

3 . 6^g of liurificated cleaved cecracycl ineresisr:ance 
gene-deleted p3R322 was ligated to 2.76pig purified 
^reassembled restriction digested FIND- fragment at 12°C 
water bath for 16 hours. 50^1 of the vector was iTiixed with 
60^1 of the insert and 15^1 of lOx buffer (supplied with 
the enzyme) 7 . 5/il ligase (5 U/^l) and sterile water to a 
final volume of ISOfil, A ligation of 2/ig restriction 
digested tetracyclineresistance gene-deleted pBR322 without 
anv insert was also performed in the samie manner. 

Transformation of chemically competent E coli BMH 71-18 
with the ligated reassembled FIND-insert and pBR322 : 

The ligation reactions were purified by 
phenol/chloroform extraction as described above. The upper 
phase from the extraction was collected and mixed with 2.5 
volumes of 99.5% Ethanol (1/10 was 3M Sodium Acetate, pH 
5.2). The DNA was precipitated for 1 hour in -80 °C. The 
DNA was then pelleted by centrif ugation for 3 0 minutes in 
a microfuge at 14.000 r.p.m. The pellet was washed once 
with 70% ethanol and then re-dissolved m 10/xl of sterile 
water. 5^1 of each ligation was separately mixed with 95^1 
chemically competent E coli BMH 71-18 incubated on ice for 
1 hour and then transformed accordingly to the modified 
protocol of Detlef (Modified Hanahan, revised M. Scott, F. 
Hochstenbach and D. Giissow 1989). After one hour's growth 
the bacteria from the two transformations were spread onto 
ampicillin containing agar plates (100/ig/ml) . The plates 
were grown upside-down in a 37^C incubator for 14 hours. 

Testing of ampicillin-resistant transformant for 
tetracycline-resistant recombinants : 

The transf orrr.at ion v;ith reassembled FIND- fragment and 
tetracycline-deleted pBR322 gave 122 ampicillm-resistant 
transf ormants . The religated cleaved empty t etracycl ine - 
eyeleted pBR322 gave 100 trans f ormants . The transf ormants 
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one at a time to tetracycline (lO/ig/ml) containing agar 
plates and to ampicillin containing plates at the same time 
and to corresponding locations. These plates were incubated 
in 37^C incubator for 14 hours . 

Counting of tetracycline resistant recombinants: 

The colonies on both the tetracycline plates and the 
ampicillin plates were counted the following day for both 
transf ormants . 

FIND EXPERIMENT II: 

The above described methods were used for a second 
BAL31 Nuclease treatment with a mixture of Sfxg of FIND 1 
and 5/ig of FIND 3 as described above and in the overview in 
Fig, 1. This time new PGR- fragments had been generated with 
the estimated concentrations of 192.25ptg/ml for FIND 1 and 
2 3 1.5/ig/ml for FIND 3. The following reaction micture was 
used: 26/il FIND 1, 21.6/:il FIND 3, lOO/il 2x BAL.3 1 exonulease 
buffer, 9.9/il BAIi31 Nuclease and water to 200^1. A blank 
was also prepared with 13/il FIND 1 and 10.8/xl FIND 3, 36/il 
2x BAL31 exonulease buffer, Ojul BAL31 Nuclease and water to 
72/il . 

The BALi31 digestion v;as perform.ed as described in the 
previous experiment and samples were withdrawn at the same 
timepoints to tubes with 200mM EGTA to get a final 
concentration of 20mM EGTA. The exonuclease in the 
resulting samples was heat-inactivated as described above 
and the fragments where extracted, precipitated and 50% 
were loaded on agarose gel . After the same appearance as 
previously on the gel had been established, the samples 
were purified and 2 sequential PGR- reactions were run as 
before. The final PGR- fragment was cloned into tetracycline 
deleted pBR322 under the same conditions as above. The 
1 igat ion was then elect r operated into e lect rocompetent 
cells as described (Dower, W.J., J. F. Miller, and C . "W . 
Ragsdale . 1988: High efficiency transformation of E.coli by 
high voltage electroporat ion . Nucleic Acids Res. 16:6127.) 
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and plated on ampic:Lllin agar places as before. Several 
t-housands of transf orrnants were achieved. 3 97 of these were 
transported as described above to tetracycline agar plates 
and ampicillin agar plates at the same time. The amount of 
5 tetracycline revertants V7ere counted the iollov;ing day 

after incubation in a 37°C incubator for 14 hours. 

The tetracyclin recombinants were then plated for 
separate colonies onto new tetracyclin plates. Separate 
colonies were then inoculated into liquid cultures with Ix 
10 T3-media (Terrific Broth; Molecular cloning; A LABORATORY 

MANUAL, Second Edition, Cold Spring Harbor Laboratory 
Press ; 1989) with 1% Glucose and both ampicillin and 
tetracycline with the above concentrations and grown for 
plasmid-preparations with Qiagen Plasmid Midi Kit (Qiagen 
15 Inc., Chatsworth, CA, USA). Glycerol stocks of these 

overnight cultures were prepared by mixing 500^1 of 
bacterial culture with 215pil of 50% Glycerol and storing 
these mixtures at -80°C. 

A bacterial PGR- screening with the two external 
20 primers mentioned above of 40 of the tetracycline- sensitive 

colonies was performed to estimate the frequency of empty 
religated vector among these t ransf ormants . This was done 
with the PCR-mixture mentioned previously scaled down to 
25^1 reactions. These were inoculated v;ith one sensitive 
25 bacterial colony each and the PCR-profile was as above for 

30 cycles. The resulting PGR- f ragminets were analysed on gel 
as described above . 



FIND-eacperiment I : 



No. of amp. -resistant rlND- 
transfcrrr.ants 


Nc. of tet -resistant FiND- 
t r a n s f c rma n t s 


122 




^ requenc\' cf recombinant^: 1 c 


No. cf amp . -resistani reiic. 
s e n s i 1 1 e v e c t c r 
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100 


0 


Frequency of recombinants: 0 


% 


FIND-eacpeiriment II : 




No. of amp . -resistant FIND- 
transf ormant s 


No. of tet-resistant FIND- 
transf ormant s 


397 


22 


Frequency of recombinants: 5 


.Jo 



2 out of 40 bacterially PCR-screened sensitive clones 
were empty religated vector. This would then make up 5% of 
15 the total number of transf ormants . Therefore, 20 out of 397 

is empty vector. This increased the number of recombinants 
to 5.8%. 



FIND EXPERIMENT III: 

20 The FIND procedure is not restricted to the usage on 

tetracycline genes , but can be applied to any type of 
genes encoding a protein or protein motif. This is 
exemplified by creating a nev; repetoir of antibody 
fragments with mutations evenly spread over the entire 
25 antibody variable genes after FIND treatment. 

Single base pair mutations were introduced into the 
VL and VH-regions of the anti-FITC scFv antibody fragment 
Bll (PCoba-yashi et al . , Biotechniques 1997 Sep ; 2 3 ( 3 ) : 5 0 0- 
503) by the use of error prone PGR in accordance with 
30 rCuipers et al., (Nucleic Acids Res 1991 Aug 

25 ; 19 (16) : 4558 ) except for a raise in the MqCl. 
concentration from 2miyi ro 5ml^ . This antx FITC scFv 
antibody fragment was constructed by the use of overlap 
extension PGR, and zhe overlap extension procedure has 
35 previoulsy been used for the random combination of DNA 
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;3oderlind ez al. Gene 1995 ^ul 2c; 160 C 



The rTiUiared products were "hen subjecred no 
centrclled degradaricn v;iLh BAL31 exonuclease v:hich can 
5 be used for removing nucleotides from ohe oermini of 

acuble soranded DNA in a controlled manner. It is 
oredominant ly a 3* exonuclease (Sambrock eo al., 
Sam.brook, J., Fritsch E.F. and Mantiatis T. Kiolecular 
Clonmg-a laboratory Manual Cold Spring Harbor Laboratory 
10 Press, 2nd edition, 1989) and removes mono nucleosides 

from both 3' termini of the two strands of linear DNA. In 
addition, it also acts as an endonuclease degrading the 
ss 2'NA Generated bv the exonuclease activity. Degradation 
is completely dependent on the presence of calcium and 
;;L5 the reaction can be stopped at different stages by adding 

the calcium chelating agent EGTA. 3al31 works 
asynchronously on a pool of DNA molecules, generating a 
population of DNA of different sizes whose termini have 
been diaested to various extents and whose single 
20 stranded DNA tails vary in length. DNA of interest is 

digested with BA.L31 and samples are withdrawn at 
different times and placed in a solution with EGTA, whicn 
does not interfere with the activity of Taq polym.erase. 
Thus, ?CR based reassembly is possible directly after the 
25 digestion procedure. The average length of single- 

stranded tails created by digestion of linear ds DNA is 
depenaent both on time of Bal31 treatment and the enzyme 
concentration. High enzym.e concentrations of 2-5 U/rrJ 
yields an average of 5 nucleotides of ssDNA per terrr;inus, 
30 whereas 0.1-0.2 U/m.l can yield longer ssDNA. 

After the oreatm.eno c^f BA.L31, the pool of 



f r a cm.e n t s of v a r y i n g sizes, w n i c n w^ e r e 



reassembled as previously described into full length scF' 
genes. The resulting genes were cloned into the pnagerr^io 
vectir cFXmideS and the resulting library size after 
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transf ormar ion was 5.7x10" cfu/pg DNA, 

Single clones from the library were sequenced to 
estimate the genetic variability in the library. The 
frequencies of mutations found, distributed over the 782 
5 bp long VL-VH-region of the scFv antibody ranged from 1- 

56 (Table 1) , This is a mutation rate ranging from 0.13% 
to 7.16%, whereas the mutation rate for error prone PGR 
has been reported to be 0.7% (Kuipers et al . , Nucleic 
Acids Res 1991 Aug 2 5 ; 1 9 ( 1 6 ) : 4 558 ) . This result 
10 demonstrates the effect of recombining mutations in a set 

of genes, resulting in a varied gene population which can 
be used in selections/ screening of proteins with new and 
altered functions . 

Reagents : 

7\mpliTaq® polymerase was purchased from Per kin-Elmer 
Corp.,. dNTPs from Boehringer Mannheim Biochemica 
(Mannheim, Germany) , and BAL 31 Nuclease from New England 
Biolabs Inc. (Beverly, USA). All restriction enzymes were 
purchased from Boehringer Mannheim Biochemica (Mannheim, 
Germany) . Ethidium bromide was purchased from Bio-Rad 
Laboratories (Bio-Rad Laboratories, Hercules, CA, USA) . T4 
DNA Ligase was purchased from Boehringer Mannheim 
Biochemica (Mannheim, Germany) . 

Prxmexs : 

All primers were designed and synthesised at the 
department with a Applied Biosystems 391 DNA-synthesiser . 
The restriction sites introduced in each primer are 
underlined. 

Reamplif icatxon Primers : 

For error prone PGR and reampli f icat ion PGR after 
Bal3 1 treatment : 
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-Termer DL : riTC-bll-VL3' -FLAG-SAL 1: 

.-^I:;: /-.rpfy nn,^m np Qrjl Q QA. C T "J" li l'"' Z^_^T Q 21 CAT 

:AC CTA GGA CCG TCA GCT TGGT -3' (SEQ IG #1! 



10 



5' -primer DL:FITC 511-VH-5' Ncol : 

5 ' - ACT CGG GGC CCA ACC GGC CAT GGG GGA GGT GGA GCT GT 
GGA G -3' (SEQ ID #11) 

Sequencing Primers : 

Seauencing reversed pEXmide 4 : 5 ' -GGA GAG CCA CCG CCA 
TAA C-3' (SEQ ID #12) 



15 



pUC/M 13 reversed primer: 5 ' -TCA GAG AGG AAA GAG CTA TGA 
C-3' (SEQ ID #13) 



Plasmids 

pEXmide V: 4055 bp Ncol- and Sail -sites are marked with 
20 underline text is shown in Figure 8. 

Error Prone PGR: 

The error prone PGR reactions were carried out in a 
10 X buffer containing 500 mIM NaCi, 100 mM Tris-HCl, pH 
25 8.8, 5mM MgCl- 100 uq gelatine (according to Kuipers et 

al Nucleic Acids Res. 1991, Aug 25;19 (16}:4558) except 
for a raise in rhe MgCl_, concentration from 2 rnl^, ro 5 



For each 100 reactic' 
dATF 5 mM 5 ul 
dGTF 5 mM 5 ul 
dTT? 10 zrM 10 ^'1 
dCT? 10 rvM 10 .^1 



f ollov:inc was rrLixec: 
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20 yuM 3' primer 1.5/ul 
20 5'-primer 1.5 /ul 

lOx Kuipers buffer 10 /^l 
sterile mp H2O 46.3 /^l 

The template scFv FITC Bll in pEXmideV vector (24.5 
ng/yil) was added at an amount of 42 ng, 10 pi of 10 iniyi 
MnCl2 was added and the tube was checked that no 
precipitation of Mn02 occurred. At last 5 Units of Taq 
enzyme was added. The error prone PGR was run at the 
following temperatures for 25 cycles without a hot start: 
94°C 1', 45 °C 1', 72 ""C 1' , using a 1 second ramp time, 
followed by a rapid cooling to 4°C. The resulting product 
was an error proned insert over the scFv FITC of 782 bp. 
This insert was purified with Qiaqucik PGR purification 
kit, before BAL 31 Nuclease treatment. 

BAIj3 1 Treatment : 

Error proned purified insert of the FITG B1 1 was 
digested with 0.5 U BAL 31 enzyme/pg insert DNA. 1.5 ml 
sterile eppendorf tubes with DNA, 2x BAL31 Nuclease 
buffer and water were pre-incubated in 30°C for 10 
minutes. After this pre-incubation, BAL31 Nuclease was 
added except for one control tube to a final 
concentration of 0.5 Unit/pg of DNA. The control tube, 
thus, contained only DNA buffer and water. After t= 2', 
4' , 6' , 8/ and finally 10 minutes, the tube was mixed and 
samples were removed and added to the tubes with EGTA and 
placed on ice. The working concentration of EGTA was 
20mIM. At the same time the control volume was removed 
from the water bath and this sample was also mixed with 
EGTA and placed on ice. The tubes were then placed in a 
65°C water-bath for heat inactivation of the enzyme and 
then replaced onto ice. 
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Reassembly of BALBl generated fragments: 

The reassembly of the generared fragmeri^ pools v;ere 
o e r f o rme d as r:* r e v i o u s 1 y d e s o r :i be d in o v: o s u b s e q u e r. o ? C R 
reacooons. The first ?CR reaction was performed v;iohoot 
the addition of any external primers cy mixing equal 
am.ounos of the cifferent time pools in a soandard PGR 
reaction. The ?CR reaction was run at 40 cycles 
consisting of following profile: denaturation (94''C for 1 
minute), primer annealing (55'^C for 1 m.inute) and 
extension (72°C for 1 minute) using a 1 second ramp time. 
The PGR reacoions contained, unless otherwise noted, 5 ul 
of each primer ^20 uM), 16 ul of a dNTP mixture (1.25 mJ^l 
each of dTTP, dATP, dGTP and dGTP) , 10 ul lOx reaction 
buffer supplied with the enzyme, 0.5 pi Am.pliTaq® 
thermostable DUA polymerase (5U/ul) ( Per kin-Elmer Corp.) 
and water to a final volume of 100 pi . 

The reassembled products were then reamplified v/ith 
a PGR containing the 3^- and 5 ' -external primers to 
generate an insert of the correct size and thereby also 
introducina the restriction sites Ncol and Sail for 
cloning into the pEXmideV vector. The PGR reaction was 
run at 25 cycles consisting of following profile: 
denaturation (94°G for 1 m.inute), primmer annealing (oS'^C 
for 1 mmuoe) and extension (72°G for 1 minute) using a 1 
second ramp timie. The PGR reactions contained, 5 pi of 
each prim.er ( 2 C' pM), 16 ul of a dl.^TP m.ixture (1.25 ml^l 
eacn of dTTP, dATP, dGTP and dGTP.*, 10 pi ICx reaction 
buffer supplied with the enzyme, 1.5 ul AmpliTaqvs; 
tnerm.ostable DNA polymerase (5U/pl) f Per kin-Elmer Gcrp.) 
and water to a final volume of 100 pi. The suosequenr 
insert was purifieo on a 2^ agarose gel using the 
i i a c u 1 c pel extraction k i o ( r' ob a y a s h i e t a 1 . , 
H 1 c- 1 e c h n i c u e s 19 9 7 Sep; 2 3 ( 3 ) : 5 j 0 - 5 0 3 ) , 



Cloning in the PEXMIDEV Phagmid Vector: 
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The insert and vector were digested with the Ncol 
and Sail enzymes from Boehringer Mannheim. The insert was 
cleaved with 10 U enzyme /pg DNA and vector with 4 U/pg 
DNA. The insert was then gel purified as described 
previously and the vector was purified using the Microcon 
100 micro concentrators (Amicon, Inc., Beverly, MA 01915, 
USA) . The vector was then cleaved with a third enzyme, 
the Pst I enzyme, who's restriction site is located in 
between the first two enzymes. The vector was gel 
purified with the Qiaquick gel extraction kit (Qiagen 
GmbH, Hilden, Germany) . Insert and purified vector were 
ligated with 25 U T4 DNA ligase/pg DNA (Boehringer 
Mannheim) at a vector to insert ratio of 590 ng vector 
to 240 ng insert {12:1 molar ratio) for 14 hours at 12''C. 
The ligation reactions were purified by phenol chloroform 
extraction and ethanol precipitation and subsequently 
transformed into electro competent Top 10 F' bacterial 
cells. The library size was determined to S.TxlO'' cfu/pg 
DNA. Glycerol stocks were produced after transformation 
according to J. Engberg et al (Molecular Biotechnology 
Vol 6, 1996 p287-310) and stored at -20°C. 

Sequencing : 

Separate colonies from the glycerol stock library were 
grown and plasmid preparations were performed with 
Promega Wizard Pius Minipreps DNA purification System 
(Promega,. Madison, WI USA). The VL and VH insert of these 
plasmids were amplified with a PGR containing the 3'- and 
5'-exrernal primers to generate an insert of the correct 
size. These inserts were then sequenced with Big Dye 
Dyedeoxy""" Terminator Cycle Sequencing Kit. The sequencing 
was performed on a AEI Prism 377 DNA Sequencer. 

Table 1 

Number of mutations in the 782 bp long scFv sequences 
after FIND treatment 
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Clone 


Number of Murarions 


1 








3 


8 


4 


23 


5 


50 


6 


o b 


7 


10 


8 


26 


9 


36 


10 


18 



15 
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Claims 

1. A method for generating a polynucleotide sequence or 
population of sequences from a parent polynucleor ide 
sequence encoding one or more protein motifs, comprising 
the steps of 

a) digesting the parent polynucleotide sequence with 
an exonuclease to generate a population of fragments; 

b) contacting said fragments with a template 
polynucleotide sequence under annealing conditions; 

c) amplifying the fragments that anneal to the 
template in step b) to generate at least one 
polynucleotide sequence encoding one or more protein 
motifs having altered characteristics as compared to the 
one or more protein motifs encoded by said parent 
polynucleotide . 

2. A method according to claim 1 wherein the parent 
polynucleotide is double-stranded and the method further 
comprises the step of generating single-stranded 
polynucleotide sequence from said double-stranded 
fragments prior to step b) . 

3. A method according to claim 1 or claim 2 wherein the 
template polynucleotide sequence is the parent 
polynucleotide sequence. 

4 . A method according to claim 3 wherein the parent 
polynucleotide sequence has been subjected to 
mutagenesis . 

5. A method according to any one of the preceding 
claims wherein the population of fragments generated in 
step b) are subjected to mutagenesis. 
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n H rp.'S tL n c ci c o ^ Lr. c c ^ ^ 2. rr 4 cir c ^ s — rr. o w n e ir b il n ti n s 

rr.uT:aaenes:is is error prone mooaoenesrs . 

7. A meohod accorcrng oo any z-ne of the preceding 
clairr.s v;herein rhe exonuclease is 3AL31. 

8. A merhod according to any one of the preceding 
claims wherein the pareno polynucleotide sequence encodes 
an antibody or fragment thereof. 

9. A method according to any one of claims 1 to 9 wherein 
the parent polynucleotide sequence encodes an enzyme. 

10. A method according to any one of the preceding 
claims further comprising the step of screening the at 
least one polynucleotide generated m step c) for desired 
characteristics . 

11. A inethod according to any oen of the preceding 

20 claims further comprising the step of expressing the at 

least one polynucleotide generated in step c) and 
screening the resulting polypeptide for desired 
characteristics . 

25 12. A method for preparing a pharmaceutical composition 

which comprises, following the identification of a 
polynucleotide v;ith desired characteristics by a m.ethod 
according to any one of claims 1 to 11, adding said 
polynucleotide to a pharmaoeut ically acceptable carrier. 



15 



30 



13. A metnod for preparing a pharmaceutical composition 
which comprises, following tne ideno i f i cat i on of a 
colvpeptide witn oesired cnar act er i st i cs by a methoa 



.-rdma 10 ciairr. 11, adding said pclypep-ide rc 
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14. A process which comprises, following the 
identification of a polynucleotide by a method of any one 
of claims 1 to 11, the manufacture of that 
polynucleotide, in whole or in part, optionally in 
conjunction with additional polynucleotide sequence. 

15. A process which comprises, following the 
identification of a polypeptide by a method according to 
claim 11, the manufacture of that polypeptide, in whole 
or in part, optionally in conjunction with additional 
polypeptides . 

16. A process according to claim 15 wherein the 
polypeptide is an antibody or fragment thereof. 

17. A process according to claim 15 wherein the 
polypeptide is an enzyme . 

18. A process which comprises, following the 
identification of a polynucleotide according to any one 
of claims 1 to 11, the use of that polynucleotide, in 
whole or in part, optionally in conjunction with 
additional polynucleotide sequence, in therapy or 
diagnosis,, or in the detection and/or amplification of a 
target polynucleor ide in a sample. 

19. A process which comprises, following the 
identification of a polypeptide according to claim 11, 
the use of that polypeptide, in whole or in part, in a 
method of treatment. 

20. A process which comprises, following identification 
of a polypeptide according to claim 11, the use of that 
polypeptide, in the manufacture of a therapeutic or in 
vivo diagnostic agenr. 
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21. A process a c c o r d i n c: to c 1 a iiPt 19 or claim 20 wh e r e i n 
zhe o c 1 vo e "ozLde i. s an a n ^ 1 b o d or f r a gme no o n e r e o f . 

22. A o-rocess aoco'rdlng to claix 19 or claim 20 v:herein 
t:he Dclypepoode is an enzyme. 

23. ?. orocess according to claim 19 or claomi 20 wherein 
the polypeptide is an antigen. 

24. A kit for use rn the method according to any one of 
claims 1 to 11 . 

25. A kit comprising an exonuclease and instructions fo 
carrying out the method according to any one of claims 1 
to 11. 

26. A kit according to claim 25 wherein the exonuclease 



wo 98/58080 



PCT/GB98/01757 



1/9 



LEGEND 



Tetracycline resistant 



Tetracycline sensitive 



Tetracycline sensitive 



Mix clones 



0 



(^Digest with Bal 3l) 



CReossembly with PCR) 

Fig,l 



SUBSTITUTE SHEET (RULE 26) 



N5DOC1D -rwo ^eseoeoAi 



w o 98 58080 



I*CT C;B^)8 01"5' 



2/9 

Fragment from BAL31 treatment: a-f different 
mutated gene fragment. 

Template: BAL31 untreated full 
. length gene, mutated or not. 



^ PGR elongation, one cycle 



^ PGR elongation, final cycle 




PGR Amplification with external 
primers 



^ Resulting gene with genetic recombination 




Fig. 2 
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Ikb 2m. Am 6m. Sm. 10m. untreated | 



3054 bp. 
2035 bp. 
1635 bp 
1018 bp 

505 bp 
395 bp 

220 bp 




Fig. 5 
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A 

CTAGCGCTATATGCGTTGATGCA/^.TTTCTATGAGCACCCGTTCTCGGAGCACTGTCC 
GACCGCTTTGGCCGCCGCCCAGTCCTGCTCGCTTCGCTACTTGGAGCCACTATCGAC 
TACGCGATCATGGCGACCACACCCGTCCTGTGGATCCTCTACGCCGGACGCATCGTG 
GCCGGCATCACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATATCGCCGACATCACC 
.-ATGGGGA-AGATCGGGCTCGCCACTTCGGGCTCATGAGCGCTTGTTTCGGCGTGGGT 
ATGGTGGCAGGCCCCGTGGCCGGGGGACTGTTGGGCGCCATCTCCTTGCATGCACCA 
TTCCTTGCGGCGGCGGTGCTCAACGGCCTCAACCTACTACTGGGCTGCTTCCTAATG 
CAGGAGTCGCATAAGGGAGAGCGTCGACCGATGCCCTTGAGAGCCTTCAACZCAGTC 
AGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTC 
TTTATCATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTGGGTCATTTTCGGCGAG 
GZ\CCGCTTTCGCTGGAGCGCGACGATGATCGGCCTGTCGCTTGCGGTATTCGG.CuZlTC 
TTGCACGCCCTCGCTCAAGCCTTCGTCACTGGTCCCGCCACCAAZ\CGTTTCGGCGAG 

AAGCAGGCCATTATCGCCGGCATGGC 



B 

GAGCCACTATCGACTACGCGATCATGGCGACCACACCCGTCCTGTGGATCCTCTACG 
CCGGACGCATCGTGGCCGGCATCACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATA 
TCGCCGACATCACCGATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGAGCGCTT 
GTTTCGGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGGGGACTGTTGGGCGCCATCT 
CCTTGCATGCACCATTCCTTGCGGCGGCGGTGCTCAACGGCCTCAACCTACTACTGG 
GCTGATTCCTAATGCAGGAGTCGCATAAGGGAGAGCG 



Fig. 6 
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A 



CCGTTNAAGNNNACACAGTTANATTGTTAANGCAGTCAGGCACCGTGTATGAAATC 

TAACTVATGCGCTCATCGTCATCCTCGGNACCGTCACCCTGGATGTTGTAGGCATAG 

GCTTGGTTATGCCGGTACTGCCGGGCCTCTTGCGGGATATCGTCCATTCCGACAGN 

ATCGCCAGTCACTATGGNGTGCTGCTAGCGCTATATGCGTTGATGCAATTTCTAIG. 

AGCACCCGTTCTCGGAGCACTGTCCGACCGCTTTGGCCGCCGCCCAGTCCTGCTCG 

CTTCGCTACTTGGAGCCACTATCGACTACGCGATCATGGCGACCACACCCGTCCTG 

TGGATCCTCTACGCCGGACGAATCGATGGCCGGAATCACCGGGGTCACAGGTGCG 

GNTGCTGGNGCCTATTTCGCCGACATCAACGATGGGGAAAGATGNGGCTCGNCAC 

TNCGGGCTCATNAGNNTTTGGTTTCGGCNTGGGTATTGGTNGGAAGNCCCCCANG 

GCCGGGGGGATTGTTNGNGNGCCAACTTCCTTGGATTGAACAATNCCCTNGGGGG 

GGGGGGGTTCANCNGGCHCAACCTATTNNTGGGATTNTTNCNNATNNAGAGTCGA 

TAAGGAGGNGNNGGCCT^^TCCNTGNAGCCCACCC 



CAGTATGACCATNNNCTAGCTTCTCGNCGAGACGTTTGGTNGCNGGACCAGTTAC 

GAAGGCTTGAGCNAGGGAGTTGAAGATTCCNTATACTNAATGNGATAGGNCTATC 

ATCGGNGGGCTCCANAGATAGCGGNCANCGNCNACANATGACCCAGAGCTNTGC 

CGGCANCAGTCCTACGAGTNGNATGATNAAGTAGANAGGCATAATTGGGGNGACG 

ATAGTCATGNCCCGCGGCCACCGGAAGGAGCTTAATGGGTTGNNGGCTCTCAAGG 

GCATCGGTCGACGCTCTCCCTTATGTGACTCNTGNATTAGGAAXCAGCCCAGTTNG 

CTAGGTTTGNGGCCGNTTGNAANCAACCCCCGNCCNNAxNAGGGAATTGNTGNAAT 

NNAAAGGGNGTTTGGGNGNCCCAACAAGTCCCCCCCGNGCNANNGGGGGCCCTC 

CCACCAATTNCCCCACGGCCGAAAAAAAT^GTTTTCAATNAAGCCCCNAGGTNGG 

GGAACCCCTNTTCTTCCCCCATCGGNGGANATTTGGNTGAATTTTTGGGGNCCA7VN 

ANNCCCNNCTTTNGGGTCCGNTNTTATNTCCCNCCCACAATTNNTTCCCGTTTNGG 

GGNNNNNTCCN/VANGAAGGTTTTNTTTCCCCCCCNATTTCCNCTTTATNCNNTTTN 

TNNTTTNNNNATAGAAAAANAAAANTTTGGGGGNGCCAAGGTTTNATAATATTT 
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as^QC^TG^A-^Gf^r^JiATTCTATTTCAAGGAGAZAGTCATAATGAAATACCTATTGCCTACGGC 
AS^^G'^'^GGAT'^'^-TT^'jiT'^.'^CTCGCGGCCCAACCGGCi^AXQS 

CG^GCrrTGrAGG^TAGCACTAGTGGTACC£T£iA£AAGAAAGTTGAGCCCAAATCTTGAAC 

T-z^^GA^GCArAr^TCAGGAGGTTAGGGTGGCGGTGGCTCTCCATTCGTTTGTGAATATCAAG 

r-r^i^TCGT'^TGAr^rTGCCTCAACCTCCTGTCAATGCTGGCGGCGGCTCTGGTGGTGGTTCT 

SGTGG-GGrrcTGAGGGTGGTGGCTCTGAGGGTGGCGGTTCTGAGGGTGGCGGCTCTGAGGG 

AGGrGGT-TCrGGTGGTGGCTCTGGTTCCGGTGATTTTGATTATGAAAAGATGGCAAACGCTA 

AT^GGGGGrTATGACCG-AAAATGCCGATGAAAACGCGCTACAGTCTGACGCTAAAGGCAAA 

CTTGATTCTGTCGCTACTGATTACGGTGCTGCTATCGATGGTTTCATTGGTGAGGTTTCCGG 

C-TTG^TAATGGTAATGGTGCTACTGGTGATTTTGCTGGCTCTAATTCCC-^AATGGCTCAAG 

T^GGTGACGGTGATAATTCACCTTTAATGAATAATTTCCGTCAATATTTACCTTCCCTCCCT 

CAATrGGTTGAATGTCGCCCTTTTGTCTTTAGCGCTGGTAAACCATATGAATTTTCTATTGA 

TTGTGArAAAATAAACTTATTCCGTGGTGTCTTTGCGTTTCTTTTATATGTTGCCACCTTTA 

TGT^TGTATTTTCTACGTTTGCTAACATACTGCGTAATAAGGAGTCTTAATAAGGGAGCTTG 

CATGrAAAT-^CTATTTrAAGGAGACAGTCAT/VATGT^TACCTATTGCCTACGGCAGCCGCT 

GGATTGTTATTACTGAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAZy^CCCTGG 

CGTT^^'^r-AArTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAG 

AGG'-'CCGCACi^GATCGi^CCTTCCCA^CAGTTGCGCAGCCTGAATGGCGAATGGCGCCTGATG 

CG-TATTTT^TCCTTA^GCATCTGTGCGGTATTTCACACCGCATACGTCAAAGCAACCATAG 

TACGrGrCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGC 

?A^A^?^GCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGT 

TCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCT 

TTACGG^ACCTCGACCCCAAAAAACTTGATTTGGGTGATGGTTCACGTAGTGGGCCATCGCC 

CTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTC 

TnnAAArTGGAACAACACTCAACCCTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTT.^ 

CCGATTTCGGCCTATTGGTTAAA-AAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAA 

C^?A??;^CGTTTACAATTTTATGGTGCACTCTCAGTACAATCTGCTCTGAT^ 

AGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGA. GGGCTTG^G^ 

crGGCATCCGCTTACAGACi^JVGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTC 

k?rc5cI5cACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATA^^ 

ATG-rATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCb._GGA 

A'-CC'^TATTTGTTTATTTTTCT.W.TACATTCAAATATGTATCCGCTCATGA3ACAATAA<^C 

.-TGATAAATG^TTCA^'.TJ^ATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCG 

r-^CTTATTrCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGA 

AAGTAAAAGAT-GCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAAC 

iGCG^SATCCTTGAGAGTTTTCGCGCCGAAGAACGTTTTCCAATGATGAG^ 

AGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAALTCuGTC'^CC 

rrA-Ar-ACTATTr-TCAG.AJ^Tr-ACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACG 

G^TGGrz^TGArAGTA.A3AGA.-.TTATGCAGTGCTGCCATAACCATGAGTGATAZvCACTGCGGC 

^A.^^TT";\CT'^rTr-z\CAArGATrGGAGGACCGAAGGAGCTA^CCGCTTTTTTGCACAACATGG 

,r.„,." ;;^(3^^z^^TrcrrTT(-:ATCGTTGGGA7-'.CCGGAGCTGAATGAAGCCATACCZ-vA.aCGAG 

"AGrGf-\-^rrAr^GATGGCTGTAGCWGGCAACAACGTTGCGa^CTATTA7^XTGGCi 

- ^^--^,r^;:^ z^^r-^.-r.-^-GG V\CA.i^tTA.ATAGACTGGATGGAGGCGGATAAAGTTGCAG 
■^^,-,-^'-^;^;:^,:^'^^^PP„.3^rcTTCCGGCTGGCTGGTTTATTGCTGATAAB.TCTGGAGCC'^.GT 
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GAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGT 

AGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGA 

TAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAG 

ATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCT 

CATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGA 

TCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAA 

CCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGT 

AACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCC 

ACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTG 

GCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGA 

TAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGA 

CCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGG 

AGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCT 

TCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGC 

GTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCC 

TTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCC 

TGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAA 

CGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCT 

CTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGC 

GGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTAC 

ACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGA 

AACAGCTATGACCATGATTACGCC 
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TetracycLine sensitive 



: Tetracycline sensitive 



(Javk clonesj 



(Digest^ 
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Fragment from BAL31 treatment: a-f diffarent 
mutated gone fragment. 

Template: BAL31 untreated full 



^ length gerte, mutated or not 



^ PCR eiongation, one cycle 



^ PCR elongation, final cycle 




I PCR Amplification with external 
▼ prlmer« 



^ Resulting gene with genetic recombination 



Fig, 2 
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Centre 



FIND 



Cent re 
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Centre 



TerrplG^e 



FIND 



Centre 



^ — » » 

» » > > » > 




< ^ 4 ^ Templat e 
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1 kb 2m. Am 6m. 8m. 1Clm. urttreated 



2D36bp 
1636 bp- 
1018 bp- 

505 bp J 
396 bp- 

220 bp 
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A 



GJiCCGCrTTG<;CCGCCGCCC:AGTCCT<3"2'rCGCrTCGCTACTTGGAG 
T'ACGCGL2iTQ?\TGGCG7'LCO^CACCCGTCCTGTG'iATCCT 

G^LTGGGGAAGATCGGGCTCGCCACTTCGGGCTCArGAGOGCTTGTTTCOGCGTGQG'r 

TTCCTrGCGGCGGC-GGT0L':TCAACG'3CCT':^^^ 
CAGGAGTCGCATx^AGGGAGAGadl'CGACCGMGCCCTTa^^^ 
A.iaCTCCTTCCGGTGGGCGCGGGGCJtTGACTA^'CGTCGCCGCAC 
TrTATCATGCAACTCGTAGGACAGGrGCCGGGAGCv^C'TCTC-fiG 
G7iCCGCTTTCGCTGGAGC0CGACG ATGAT CGG CCTGTCGCTTGCGGT ATT 
TrGCAOSCCCTCG CrCAAG CCTTO3TC?J:-TGGTCjCCG0CJ^J^2\ZA^ cggcgag 
AAGCAGGCCJiTTATCG CCGGCAT'^GC 



aii.G CriStC'T Al*Ci2ACTACGCiGATCArGGC 

CCG::-ACGCATCGTGGCCGG'::ATCACCGGCGCirACAGGTGCGt^ 

T'CGCCGACAT-CACGGATGGGGAAGATCGGGCTCGCCLitCTTCGGGC^^ 

GrTTCGGCCHTGGijrATGGTGGCAGGCCCCGTGGCCGGGGatCTGTTG&^ 

CCTTGCj^^.TGCAO^ArTCCTTGCGGCGOC^GTGCTCx^Lf'.CGGCCTCA^ 

GCTGATrcCTAATGCAGGj^GTCGCx-.TA/'jL^iSG 



B 
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A 



CCGTT>L^J'.C^^'UONACflCi?iA">rTAl'IATrGrTAAi:')GCAG 

TAAC?.A'rGCG GTrlA.rC:GTCAT CCT'CGGNACCCnT C.A CCCTGCATCTrGTAL-GanrAG 

GCTT GGTTAT GCCGGTACTGCC'jGGCCTCTTtaC GGGATJvTCGTCCATTvi-CGACA^ 

7^.TCGCCAGTCACTATGGNGTGCTGirTAGCGCTATATi.jCCT'rioArGCAATTTCTx^I^ 

AGCAC/JCGrTCT''2(3GAGC7iCTGTCCC^ACCrJCirTTG13CCGCGGCC':l^^^ 

CTT Cio CT ACTTGG AGCCACTATCG AC'IACG CG AT iZAT GGCGAC'CiACACCCGT CCTG 

'rGGATCCTCTACijCCGGACf?LAATi::GATGGCCGGAArCACa5G<3 

GNTGCrGGN'-X^CTATTTCGCCGACATCAACGAI^GC^GC-AAAG^LT.^ 

T^^CGGGCTCAT^L^GN^JTTTGG'^■^'TCGGCJ■■]TGGGTATTGGTNl3•:I;AAG^ 

GCCGGGGGGATTGl'TNGMGWGCCAACTTCCTTGGATTioPACAATHCCGTHGG 

GG■3l:y^l3•GTTC;i>ICNGGCHCAACCTP.TTNNTGGGATTfiTTNCH^^ATW^^Al3AGTCG^^^ 

TA?iG:GAGG10GWHv;CCCA>JrCCHTGN7lGCCCACCC 



CAGrATGACCATNl'-nJCTAGCTT CT CGKCG.AGACGT^.''rGGTWGCNGGACCAGTTAC 
GAAGGCTTGAGCR^GGiSAGTTGAACnATTCCNTATACTNA/^TGNaATAOGNCTATC 
ArCGGti3GGCTCCAl^j<GATAGCGGNCAK'CCiWCWACW4ATGACC 
^GGCA>ICAGT CCTACGAGTNGNATG AThy^AGTAGAR^-SG CATA^LTTGCGGWi^ 
AT AGTC AT■G>^:^^: r^GCGGCC ACCGGi>Ji/GG A'3CTTAATGGGTTGNNGG CTCJiAGG 
Ga'VTCGGTCGACGCTCTGCCTa'Al'GTaiiCTCHTGR^.TTAGviA^O^ 
GTAGGTTTGr?i::iGCCG WTTGtv]j\Al^CA-ACC CCCG^^:::CNNAl•vAGGGA?^ 
«NA^AGGGN';?TrTGGGWGrJCCCA.ACA,^X-'i'CCCCCCCGNGCNAl^ 
CCACasJi^TTOCCCCA.CGGCCGAAA^'-AAAJU'iGTTTrCxA.^^^ 

GGrn'nsV:JWTCaWd'TaA?.GGTTT7l3TTTCCCCCCCHATTTCClOCXTTATHCNHT'rTN 
rbTOTTTEtnOMATAGAru=.AAl^jiJLWJTTTGGGGGl'?G^ 
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A? Gr^TOC^GCJ'AKI TCI' ATT T C AA.GGAG AC A-G"! CTl.TT'ATG AA AT ACCT ATTGCCTACGGC 

C:"-GCTGGATTG I'TATTACTCGGGGCC CAACCGGjZiCfi^mi^ 
r-G r,(-c-:fM-:TG CAnGCTAG C ACT AGTGGT A l:;C^aI£G^A^a?AAGTTGACiCCC/'AATO^T CJJy: 
T,=^'SACGai\rACJ\T C>/:i GAGGTTAGGGT GGCGGTCGCTCTCCATTCGTTTG-rfi^^^ 
.■^C-r.AAT CGTCT G ACCTGC.CT':;AAC.CTC CTGT CAATGCTGGCGG CGGCTCTGGTGGTGGTT CT 
.'^ GT &:^CGGCTf"T r'-lAGGGT GGTG GCT CTG7iGGGTGG CGGTTCrGAGGGrGGOGGCT CTG-^lGGG 
KGnCGQT T C^-GGTGGT:^i jCTCT GGTTCCJSaTGATTTTGATTATGfJ-AAGATGG aWiCGCTA 
ATATiG^-^GGGCTArGACCGAAAATOCCGATG/iAAACXSCGCTACAGTCTGAO^CT.^^^^^ 
CTTGP.r TCTGTCG CTACTGATTACGGTG CTGCTATCGJlTGGTTT CATTGGTGACGTTTCCGG 
CCTTnCTAATGGT.^ATGGTGCTACTGGT'SATTTTGCTGGCTC'rA^.TTCCCAPJ^rGGCrCj^Ji 
T PGGTG ACG f^TG AT A7'lTTCACCT TT/U\TG A?xTA».XTTC:CGTCAAT ATTX ACCTT CCUTCCCT 
■■^AATCGGTTGAATGT CG Cr^CTTTT GTCT'rTAGCGCTGGTAAaX-CATATG;^TTTl'CTATT'3,'x 
TT GTGAC AAAAT AAa GT TATT C GtfTGGTGTCTTTGCGTTTCTTTT ATATGTTGCC ACCt 'I TA 
TGT 6JG7 ATTTT CT ACGTTTG CTA7tCATACTGCX^rAATA;^I3a^GTCTrAAT7'AGGa^GCTTG 
r:ATGCiW.TTCTA'rTTCA^GGAGACAGTCAT2V\TGA?\ATfJr;CTATTGCCTACGGt:^iGCCGC4^ 
GGATT GT T ATT ACT'G^ji.TT Ctl.CTGGCCGTCGrTTTACAACGTCGTC3ACTGGGA?JflA^ 
CGTTAC CCAACri AAT CGCCTT'2 CAG C7'j:ATCCCCCTTTC?GGCAGCTG<3C.-:^?TJvW.GCtiAAi3 
^GGCC ;f :GCAC CGAT CGC CCT T'CCCA/iCAGTTGCG CAGCCTGAATGGa5.6ArGGC-GCCTGATG 
OGGT: AT TTT CT CCT TACG CAT CTGTGCGGTATTT C ACACCGCATACGTCA^ AGCAJiCCATAG 
TAC G CGCCCTGT AGCGGGG CArTAAGCGCGGCG<3GTGTG'3TGGTT AOGCGCAGCGTG ACCGC 
TAG J\CTT G CCAGCGCCCTA<3CG C CCGCrCCTTTCG CTTT CTTCOSTTCCrTTCTCG-::CACGT 
T r'Gi--CGGrTTTCr.i":CG TCAAGCr CTAA.JVTCGGGGGCTCCCTTTAGG'GTTCCGArTTAGTGCT 
TTAGGGCACr::TC.GACCCJJjfW^^J^^ACTTGATTTGG3T-G*.TGGTTCACGTAGTGe^ 
r^'S^iTAGAr^GGTT'fTTCGriCCTTTCiACGTTGGAGTCClACGTTCTT'TAATAGTGGAC'rCTTGr 
T '■"^■'•"3 ' " AJi a^A HA r.TCAACCCTATCTOoGGCTATTCTTTTGATTT AT AAGGGATTTTG 

CCG ATT T CGG C CTArTGGTTA-=>J\ A^JiTa^GCTGATTT AZkCJiA\A!kTTTAAa3U.?AA 
CA=AATATTAji.CGTTrACAATTTTATG3TriCACT''ZTCAGTACAi^.TCTGCT':LrrGATGCC 
A r-^T TAAG CCAGf nCGACACCCGCC.!>-ACACCCGCT-C;A.CGCGCCCTGAOGGGCTTGTCTGCTC 
C rr.GrA TCCGCTTACJ1.GACAAGCTGTGACCG5- CTCD3GGAGGTGC ATGTGTCAGAGGrTTTC 
/j-.L:GTaATC7v3CG.!y^ACGCGCa'iGACrJAV^GGGCCT03:rGATACGCCTATTTTTAlA.GGTTA 
PlT GTC ATCiAT AAT^TGGTTT CTT AGACGTCAGGTGGC?.CTTt'J-CGGGGA^ ATGTGC/vCGGA 
Ar:r\CCTATTTGTTTAtt'TTTTCrAJ*Al:'AGATTCA^.TATGTATCCGCTCATGAGAC^AATAi\CC 
CTi"TATAriATGCTTCA!^TJLATATTGA.aAAAGG?AjGAG'l\VTGAGTATTCAAavTTTC 
r C^TT7- TT CCTT TTTT G HGG CATTTTGCCTTCCTGTTTTTGGTC^/CCCAGA'iA.OGCTGGraz!. 
AAGTAA.AAGATGCTG AACAT C AGTTGGGTC:iG ACG/iGTGGGTTACATCGAACT GGATCT CAAC 
AGCGGTAAGATCCTT(3AGAGrTTTCGCCCCGAAGiiJiA:GTTTTCCAATGjT\TGAGCACTT 
AGT-TCT G CTATGTGG CC>CGGT ATTATCOr^GTATTG ACGCOSGGG AAGAaC-WvCrCGGTCGC C 
r-^f-i,rArjiCTAlTnTC7vGAJixTGACTTGGTTGAGTACTCACCAGTCA.'::;A'G?^^ 
r.}^ T G GCATG Aa\GT A=.[i AG AJlT T ATG Cfti^'T GCTGCC AT AACCAT G AGTG ATATLa^CTGCGGC 
r:AACTT74CTTCTGAC!^CGATCGGAG:GACCGA\r.v3AGCrA7-.CCG 

GGGAT CAT GT A\CT CGC '^TT GAT CGTTGGaa ACCGGAGCTGAATi.tA^GCCATACC A^ACGAC 
G A^^CGTGAQACCACGAT GCCTGTAG'LjUi-'rGGCA/^CAACGirf GCGCJ^^^^ 

ACT Ji TTT Af"T CT AG CTT C CCG GCAACj w\TT J-AT AG ACTiG^^ ATGG AGiiCGG AT AGTTG CA:.^ 
G J^CC ACTT-TG CGCT CGG C CC TTCC.GGCTGGCTGGTTTArTeCTGATAA?vTCTGGASCC.GGT 
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lAG>"Tr. CC T CACT r3ATT A7iGCATTGGTAAi::TGTCAGACC?,AGTTTACT C1\TAT7l? ACTTTAG 
AT T'^iATTT/iA^^-'iC TTOATTTT V AATTT7vJL='JiJ3'5LB.TCTAGGTGf 
.CA^GAC■CAA?■A:l■■CCCTT.Z'u^CGTGAGTTTT03TTC■a^'/w 

Ti3/,tjiGf3/TCTTCTTGAGATCCTTTTTTTCTGCiX:GTAATCTGC:TGCTTGCAA"^^^ 

^ft^TGGr•TTa=^Gt*:AGA^^C0CAGATACCAftATACTGTCCTTCTAGTGT7LGCCGTAGTTAGGCC 

Xc'^ i^rTT-rA AGAi..CTCTGTAG C ACa^CCT ACATi=vCi:"J CG CT CTGCTJ^ATCCnTGTT^LCCAGTG 
G CTGC TGCC AGT GGCGM' AAGr CGTGT CTT ACCGGGTTGG ACTCAA'lACGATAGTTACa^GJk, 

T AAGGr.Gr;A r-l nf^GT CC-GGCTGAACGOiGGGGTT CGTG CACACAGCCCAGCTTGGAGCGM^CGA 
CCTACACCG/i.ACTGAGA^^ACCTACAGCGTGAGCTAT^G.\a^t^^^^ 

GG n^i'^An AGGT?lTCCGGT A AGCGGCAGGGTCGG AAC AGGAGAGCr:;-i:: ACG AG'Z.G AGCT 
rCCAt3GGG<GA.:iACGCCTGGTATCTTTATAGrCCTGT03GGTTTC'3CCAC:CTCTGACTl':3AGC 
GT■;"GATTTTTr^»T^3ATGCTCGTCAGGGGGGCG^GAGCC^'A'lGGAA?iA?.CGCCAGCAACGCGGCC 
T TTTr A^^'GGTTCCTGGCCTTT TGCTGGCCTTTTGCrCACATGTT CTTTCCTGCGTTATCCCC 
TGATTCTGTGGAT.^ACCGT/JTACCGCCTTTGAGTGAGC'JaATACCGCTCGCCGOAGCCGAA 
CGAnCGAC5CGr;AGCGAGTCAGTGAG03AGGAAG':^iGJ'AGAGCGCOClftAr^ 
CTCCCCGnGCGTTGGCCa^^TTCATTi^A.TGCAGCTGGCACaAas.GGT 
GGGrAr^y':JAGCGCAACG-.CAATTAATGTGA'3TTAGCTCACTCATTA0Gr2ftCCCC 
ACTTTATGCTTCCGGCTCC-iTATGTTGTGTGGAATTGTaAGOSGATAJ'LCAxATTrCACAiZJrvG'GA 

aj^CAGCTATt^ACCATi'^ATTACGCC 
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